Performance data mining: Automated diagnosis, adaption, and optimization

نویسندگان

  • Alois Ferscha
  • Allen D. Malony
چکیده

The development of performance measurement and analysis techniques and tools for high-performance parallel and distributed systems has made it possible to capture a wealth of data about application and system performance behavior. These data embody the effects of interacting, performance factors found in the program, its algorithms, the architecture and hardware, and the system software, whose interdependent performance relationships grow ever more complex as the computing environment increases in sophistication. Nevertheless, the user is still, for the most part, placed in the central decision-making role in the use of the techniques/tools, the interpretation of the resulting performance information, and the guidance for program or system modification. Recent work has sought to move human decisionmaking out of the performance measurement–diagnosis–optimization loop by employing “intelligent” methods based on automated performance measurement, knowledge-based diagnosis frameworks, online, adaptive performance control, and predictive performance models built from detailed empirical analysis. The term “performance data mining” is used to characterize this work. Already back in 1997 in association with the International Conference on Supercomputing, we organized a workshop on performance data mining for high-performance parallel and distributed systems. To the best of our knowledge, this was the first scientific event dedicated to the exploitation of data mining techniques in the field of performance evaluation. The papers enclosed in this volume have their roots in this pioneering workshop. The topics discussed reflect different perspectives on the performance data mining research question and on the focus for effective strategies and techniques. In the following sections, we first discuss our views on the concept of performance data mining. We then introduce each paper and comment on how we see the ideas presented contributing to the problem understanding and solution. We conclude with thoughts for future research.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automated detection of coronavirus disease (COVID-19) by using data-mining techniques: a brief report

Background: The clinical field has vast sick data that has not been analyzed. Discovering a way to analyze this raw data and turn it into an information treasure can save many lives. Using data mining methods is an efficient way to analyze this large amount of raw data. It can predict the future with accurate knowledge of the past, providing new insights into disease diagnosis and prevention. S...

متن کامل

A Data Mining approach for forecasting failure root causes: A case study in an Automated Teller Machine (ATM) manufacturing company

Based on the findings of Massachusetts Institute of Technology, organizations’ data double every five years. However, the rate of using data is 0.3. Nowadays, data mining tools have greatly facilitated the process of knowledge extraction from a welter of data. This paper presents a hybrid model using data gathered from an ATM manufacturing company. The steps of the research are based on CRISP-D...

متن کامل

S3PSO: Students’ Performance Prediction Based on Particle Swarm Optimization

Nowadays, new methods are required to take advantage of the rich and extensive gold mine of data given the vast content of data particularly created by educational systems. Data mining algorithms have been used in educational systems especially e-learning systems due to the broad usage of these systems. Providing a model to predict final student results in educational course is a reason for usi...

متن کامل

Prediction and Diagnosis of Diabetes Mellitus using a Water Wave Optimization Algorithm

Data mining is an appropriate way to discover information and hidden patterns in large amounts of data, where the hidden patterns cannot be easily discovered in normal ways. One of the most interesting applications of data mining is the discovery of diseases and disease patterns through investigating patients' records. Early diagnosis of diabetes can reduce the effects of this devastating disea...

متن کامل

Diagnosis of diabetes by using a data mining method based on native data

Background & Aim: Detecting the abnormal performance of diabetes and subsequently getting proper treatment can reduce the mortality associated with the disease. Also, timely diagnosis will result in irreversible complications for the patient. The aim of this study was to determine the status of diabetes mellitus using data mining techniques. Methods: This is an analytical study and its databas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Future Generation Comp. Syst.

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2001